Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Voice-Text Multimodal
# Voice-Text Multimodal
Ultravox V0 5 Llama 3 2 1b
MIT
Ultravox is a multimodal voice large language model based on Llama3.2-1B and Whisper-large-v3, capable of processing both voice and text inputs.
Text-to-Audio
Transformers
Supports Multiple Languages
U
fixie-ai
167.25k
21
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase